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The diffusion coefficient of the transcription factor Lad within living Escherichia coli has been 
measured directly by in vivo tracking to be D = 0.4 fim^/s. At this rate, simple models of diffusion 
lead to the expectation that Lad and other proteins will rapidly homogenize throughout the cell. 
Here, we test this expectation of spatial homogeneity by single-molecule visualization of Lad 
molecules non-specifically bound to DNA in fixed cells. Contrary to expectation, we find that the 
distribution depends on the spatial location of its encoding gene. We demonstrate that the spatial 
distribution of Lad is also determined by the local state of DNA compaction, and that E. coli can 
dynamically redistribute proteins by modifying the state of its nucleoid. Finally, we show that Lad 
inhomogeneity increases the strength with which targets located proximally to the Lad gene are 
regulated. We propose a model for intranucleoid diffusion that can reconcile these results with 
previous measurements of Lad diffusion, and we discuss the implications of these findings for gene 
regulation in bacteria and eukaryotes. 
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Introduction 

Regulation of gene expression by the binding and unbinding 
of transcription factor proteins (TFs) at specific regulatory sites 
in response to environmental stimuh is the fundamental 
mechanism organisms use to adapt to changing environments. 
An understanding of how TFs search for and find their binding 
targets is thus of fundamental importance if we are to 
understand the design and behavior of genetic circuits and 
their components (Gerland et aU 2002} . The seminal theore- 
tical and in vitro work of Berg, Winter, and von Hippel forms 
the basis for our current understanding of the TF search 
mechanism known as facilitated diffusion: TFs are thought to 
bind non-specifically and shde in one dimension along the 
DNA, in addition to three-dimensional diffusion throughout 
the cytoplasm (Berg et aU 1981; Winter and von Hippel, 1981; 
Winter et aU 1981) . Recent studies of the lac repressor, perhaps 
the best-studied transcription factor in E. coli, have revealed 
that diffusion of repressor in live cells is rapid, of order 
D-0.4|im^/s (Elf et aZ, 2007). At this rate, a single molecule 
travels the length of E. coli in 1-10 s, and simple models reveal 
that the distribution of repressor should be homogeneous 
throughout the DNA and cell. 

Here, we test this expectation by using high-throughput 
single-molecule microscopy to determine the average spatial 
distribution of lac repressor. Lad, in a strain deleted for the 
native lac operon. Because these strains lack a specific binding 



target, we can ask if the theoretical expectation of uniform 
distribution along the DNA and in the cytoplasm is met. 
Surprisingly, we find that the distribution is inhomogeneous 
and dependent on the location of the repressor gene, showing 
that our current understanding of the search process is far from 
complete. We further show that this inhomogeneity is the 
result of competition between non-specific DNA binding on 
the one hand and exclusion of the repressor from DNA on the 
other hand. Finally, we show that this inhomogeneity can be 
detected as stronger repression of targets located near the TF 
gene than would be expected from a homogenous repressor 
distribution, and we discuss the imphcations of these findings 
for gene regulation and the evolution of genome organization. 



Results 

To visualize the spatial distribution of non-specifically bound 
lac repressor molecules, we integrated a gene encoding Lad 
fused to the fluorescent reporter Venus at four different 
chromosomal locations in a strain where all lac repressor 
binding sites have been deleted (Kuhlman and Cox, 2010). 
These sites were near the rephcation origin of the circular 
genome, near the rephcation terminus, or halfway between the 
origin and terminus on the right or left halves of the 
chromosome, or rephchores (Supplementary Figure lA), and 
particular care was taken not to include any pseudooperators 
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overlapping the 3' end of either lad or venus (Oehler et aU 
1990, 1994; Miiller-Hill, 1998; Elf et al, 2007). These strains 
were grown to exponential steady state (Supplementary 
Figure 2), fixed with paraformaldehyde, and imaged to 
determine the spatial distribution of non-specifically bound 
Lacl-Venus molecules using total internal reflection fluores- 
cent (TIRF) microscopy, the distribution of DNA using staining 
with 4',6-diamidino-2-phenylindole (DAPI), the location of the 
lacl-venus integration sites determined by the fluorescent 
reporter-operator system (PROS; Joshi et aU 2011), and the 
corresponding lacl-venus mRNA distributions determined by 
fluorescence in situ hybridization (FISH; Raj et aU 2008; So 
et aU 2011). 

We took ^1000 images for each strain and growth 
condition. We then used an automated cell-finding algorithm 
to identify individual cells in each image, binned cells by size, 
and averaged the fluorescent intensity at each pixel to 
determine the average spatial distribution of each component 
(Figure 1). Due to hardware limitations, we were unable to 
dynamically switch between epifluorescent and TIRF illumi- 
nation, and this limitation precludes the simultaneous high- 
throughput measurement and direct correlation of gene 
location and protein distribution in the same cell. Conse- 
quently, imaging of the spatial distribution of each component 




Figure 1 Image averaging procedure. (A) A representative image field with 
lacl-venus integrated near the origin [atpl locus) imaged at x 133 magnification. 
(B) Individual cells are found using an automated algorithm, and coordinates 
are established at the cellular centroid to measure cellular dimensions and 
the location of each pixel. (C) Because of the narrow distribution of cell widths, 
correctly identified cells are identified according to correspondence with the 
appropriate width and binned according to the ratio of the length to width. 
Alternating gray and white bars indicate each bin. (D) Cells within each bin are 
rescaled to the size of the average cell within that bin and averaged together on 
a per-pixel basis and over all orientations. 
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(DNA, gene, mRNA, and protein) was performed in separate 
measurements. The absolute orientation of each cell is then 
ambiguous, and we additionally perform an average over all 
possible orientations, which introduces symmetries into the 
resulting average distributions. 



Repressor gene location influences TF 
distributions 

The average distributions of DNA, lacl-venus gene location, 
lacl-venus mRNA, and Lacl-Venus protein for cells that have 
just divided are shown in Figure 2. We find that the rephcation 
origin-proximal integration locus is located midcell and the 
terminus-proximal locus at a superposition of the poles and 
midcell (Bates and Kleckner, 2005; Wiggins etaU 2010). These 
results are in broad agreement with the existing literature (Niki 
et aU 2000; Wang et aU 2006; Montero Llopis et aU 2010; 
Wiggins et aZ, 2010} . FISH reveals lad mRNA to be localized at 
the gene loci for origin and terminus integrants (Montero 
Llopis et aU 2010). When the lad source gene is located on 
a medium copy number plasmid localized to the poles 
(Figure 2, second column), lacl-venus RNA shows some 
tendency to redistribute to other locations in the cell and 
Lacl-Venus is evenly distributed throughout the nucleoid. 
When the source is located within the chromosome, however, 
we find that the distributions depend on the position of the 
encoding gene on the chromosome, and correlate with the 
spatial distributions of the source genes. 

Recalling that these strains are deleted for specific Lad 
binding targets, interpretation of the Lacl-Venus distributions 
for origin and terminus integrants is straightforward: when the 
source is near the origin, the gene, mRNA, and protein map to 
that location. When the source gene is integrated at the 
terminus, the average gene location shifts toward the poles, 
and the Lad distribution changes correspondingly. Further- 
more, the Lad-Venus molecular phenotypes depend on the 
Lad DNA binding domain. Lad-Venus strains with insertions 
at the same locations but deleted for first 42 base pairs of the 
lad coding sequence (the 14 amino acid residues of the DNA 
binding domain; henceforth referred to as LacI42-Venus} show 
that, in the absence of the DNA binding domain, the Lad- 
Venus distributions at the origin and terminus alter dramati- 
cally (Figure 2, last row). 



DNA density affects TF distributions 

E. coli changes its morphology and the gross compaction state 
of its nucleoid in response to changing growth conditions. We 
assayed the spatial distribution of Lad-Venus in a variety of 
growth states to determine whether these changes in the 
intracellular environment affect the steady-state distribution 
of Lad-Venus protein. The results for cells containing an equal 
total amount of DNA in different growth states are shown in 
Figures 3 and 4. When bulk growth is arrested in stationary 
phase cells generally contain a single chromosome, are morpho- 
logically small, and the DNA is densely packed (Figure 3, 
second column; Supplementary Figure 3). As the growth rate 
increases, cells become larger and the DNA becomes more 
loosely packed (Figure 3, third column; Supplementary 
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Figure 2 Average spatial distribution of lacl-venus expression. Cells were grown in M63 minimal medium + 0.5% glycerol before fixation. Averages are for cells of 
1 .9-2.2 |im in length, the bin identified in Figure 1D. Averages are also taken over all cell orientations, resulting in the symmetry of each plot. Raw images of combined 
transmitted and fluorescence channels are shown in the first column for DNA ( x 400, DAP! staining; row 1), gene locus ( x 400, PROS; row 2), mRNA ( x 400, PISH; 
row 3), Lacl-Venus protein ( x 400, TIRP microscopy, row 4), and Lacl-Venus protein without DNA binding domain ( x 400, TIRP microscopy, row 5). Representative 
Lacl-Venus images are at x 133magnification for visual clarity. Remaining columns show average distributions for each component with the lacl-venus source gene 
expressed from the indicated location. Pluorescent intensity is normalized to the mean fluorescence within the cell. The quantitative color scale is for Lacl-Venus only; 
other averages are qualitatively scaled for comparison. The essQ locus was the Lacl-Venus source for terminus, but the gene location shown is for the adjacent nth 
terminal locus for clarity (Kuhlman and Cox, 2010); essQ localization is identical but less clear due to reduced binding at that locus. 



Figure 3). As cells reach their maximum growth rate of 
^20 min/doubling they contain at least two or more complete 
sister chromatids (Figure 4, third column; Supplementary 
Figure 3), and the trend of DNA decondensation in fast 
growing cells continues compared with slower growing cells 
late in the hfe cycle which also contain two complete sister 
chromatids (Figure 4, second column). 

It is possible that the more diffuse distribution of DNA in 
fast growing cells shown in Figure 4 is an artifact of image 
averaging: if the spatial positioning of condensed DNA within 
the larger volume of fast growing cells is highly variable, yet 
dense, averaging will make the resulting distribution appears 
less dense than each individual nucleoid. We therefore also 
quantified the compaction density of nucleoids in individual 
cells in each growth state by measuring the degree of geometric 
curvature of the DAPI fluorescent surface representing each 
nucleoid (see Supplementary information — Quantification of 
DNA Compaction) . The results of this single-cell analysis are 
shown in Supplementary Figure 4, and demonstrate that the 
compaction density of DNA decreases monotonically as a 
function of growth rate when measured in individual cells. 

Measurement of Lacl-Venus distributions in each of these 
growth states reveals that the distributions are heavily 
dependent on the density of the DNA. When the DNA is most 
highly condensed in stationary phase, LacI42-Venus appears to 
be generally excluded from the DNA and localized toward the 
ends of the cell regardless of the source location. Conversely, 
the pattern of localization of Lacl-Venus non-specifically 
bound to DNA correlates closely with the location of the 
source gene (Figure 3, compare first and second columns). 
Similarly, the distribution of Lacl-Venus in exponentially slow 
growing cells containing a single chromosome also correlates 
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closely with the distribution of its encoding gene (Figure 3, 
compare first and third columns). However, LacI42-Venus is 
no longer highly localized toward the ends of the cell and is 
instead more evenly distributed, although there is a weak 
general tendency to avoid the central nuceloid-occupied area 
of the cell. These subtle differences in distribution may be a 
consequence of different initial conditions due to different 
integration locations. 

As the cells grow and repUcate their DNA, the resulting sister 
chromatids segregate and gene loci move around within the 
cell. We show in Figure 4 the distribution of Lacl-Venus in slow 
growing (left) and fast growing (right) cells containing two 
complete sister chromatids. We first notice the continued 
apparent tendency of LacI42 -Venus to avoid the central spaces 
occupied by the nucleoid. For the slow growing midrephchore 
integrants, the intact Lacl-Venus distributions correlate closely 
with the distribution of the corresponding gene. For the origin 
and terminus integrants, however, the correlation is less clear. 
Both distributions show hysteresis: i.e., the distribution of 
Lacl-Venus in older cells depends upon the history of the 
location of the gene in younger cells, in the following sense: 
Before rephcation, the origin locus produces Lacl-Venus at 
midcell. After rephcation and segregation, Lacl-Venus pools in 
the inter-chromatid space near where the origin was originally, 
as judged by Figures 4 and 5. Similarly, after the terminus 
relocates to midcell after rephcation and segregation, there 
remains a pool of Lacl-Venus further out toward the ends of the 
cell. We propose that this tendency of Lacl-Venus to pool in the 
nearest available space that contains less DNA is also a result 
of a tendency toward nucleoid exclusion of Lacl-Venus 
molecules, and may be related to previously observed 
asymmetry of the cell poles (Rang Camilla et aU 2011). 
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Figure 3 Average TF distributions as a function of growth state: one chromatid. Average DNA (first row) and Lacl-Venus distributions with and without the DNA binding 
domain (subsequent groups of two rows, respectively) are shown as a function of lacl-venus gene integration location (rows) for stationary phase cells (left) and slow 
growing exponential phase cells containing one chromatid (right). Stationary phase results are from cells grown overnight in M63 minimal medium + 0.5% glycerol. Lacl- 
Venus distributions were obtained by excitation with a 488-nm wavelength laser except in the case of the midreplichore integrants, where a 514-nm laser was used for 
excitation to increase the signal-to-noise ratio. The DAPI channel is shown as green in the first row to improve sensitivity to the eye. Average DNA content for exponential 
growth is scaled to be directly comparable to Figures 4 and 5 (see Supplementary Figure 3C and D and Kubitschek, 1974). Distributions of gene location in the first 
column are from cells grown in M63 + 0.5% glycerol that are 1 .9-2.2 |im in length; gene distributions in stationary phase remain similar and are compared directly in 
Supplementary Figure 5A. 



Finally, in fast growth conditions with less densely packed 
DNA the distributions of Lacl-Venus do not obviously correlate 
with the distributions of the encoding genes, nor is it obvious 
that the LacI42 -Venus shows any tendency toward nucleoid 
exclusion. Furthermore, the overall absolute correlation of the 
Lacl-Venus distribution with the DNA distribution appears 
significantly weaker in the fast growth state than in the slow 
growth state. Compare, e.g., the correlation of the distribution 
of Lacl-Venus with the DNA distributions when Lacl-Venus is 
expressed at high levels from an extrachromosomal plasmid in 
the last row in Figure 4. 

To further explore the influence of DNA density on the 
distribution of proteins, the compaction state of DNA can be 
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manipulated directly by adding drugs that disrupt transcrip- 
tion and translation (Cabrera et aU 2009). Chloramphenicol 
binds to the ribosome and stops peptide synthesis, and also 
leads to the condensation of DNA. In Figure 5A, we show the 
change in average nucleoid density and the resulting 
redistribution of Lacl-Venus in exponentially slow growing 
cells late in the hfe cycle after DNA rephcation following 
the addition of 200|ig/ml chloramphenicol; cross-sections 
through the longitudinal axis of the distributions at time t = 0 
and t=10min are shown in Figure 5B, which show the 
expected bilobed nucleoid structure after chromosome repH- 
cation and segregation. We quantify the amount of mass 
displacement at each time point, D(y , by taking the difference 
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Figure 4 Average TF distributions as a function of growth state: two chromatids. Average DNA (first row) and Lacl-Venus distributions with and without the DNA 
binding domain (subsequent groups of two rows, respectively) are shown as a function of lacl-venus gene integration location (rows) for exponentially slow growing cells 
in M63 + 0.5% glycerol (left) and exponentially fast growing cells containing two chromatids (right). Fast growth in RDM + 0.5% glucose yields a doubling time of 
22 ± 2 min. Stationary phase results are from cells grown overnight in M63 minimal medium + 0.5% glycerol. Lacl-Venus distributions of exponentially slow growing 
origin and terminus integrants we obtained by Venus excitation with a 488-nm wavelength laser; all other conditions were obtained with a 514-nm laser to increase the 
signal-to-noise ratio. Average DNA content in each case is scaled relative to slow growing cells containing one chromatid in Figure 4. Distributions of gene location shown 
in the first column are from growth in M63 + 0.5% glycerol that are 4.2-4.5 |im in length; gene distributions in exponential fast growth are similar and are compared 
directly in Supplementary Figure 5B. 



between the initial distribution at time to, F(x,y, to), and the 
distribution at each subsequent time point t^, F(x,y, t^), 
according to 

D{ti)=J2\P(^hyk,ti)-F{Xj,yj,,to)\ (1) 

The percentage of maximum displacement of both DNA 
(blue circles) and Lacl-Venus (red circles) as a function of time 



is shown in Figure 5C, in which we simultaneously fit both 
data sets to a model assuming a constant rate of displacement 
(3, i.e., D{t) = 1 —e~^^. The simultaneous fit of both data sets to 
this model is shown as the black curve in Figure 5C, and we 
find a rate of mass displacement of (3 = 0.60/min. Since areas 
of high Lacl-Venus density correspond inversely to areas of 
high DNA density after displacement, and the displacement of 
DNA and Lacl-Venus occurs at comparable rates, the most 



© 2012 EMBO and Macmillan Publishers Limited 



Molecular Systems Biology 2012 5 



Transcription factor distributions in £. coli 
TE Kuhlman and EC Cox 



A 

t=o 

f=20s 
t= 1 min 
t= 2.5 min 
t= 5 min 
t= 10 min 



DNA distribution Lacl-Venus distribution 

c- - ^ c 



Add 200 jug/ml chloramphenicol 

f ^ (—ir 



3 

3 



c )r 



2x 

1.5x 

IVIean 

0.67X 

0.5x 



— DNA 

— Lac! 




-0.5 -0.25 0 0.25 
Longitudinal position 



0.5 




• DNA 

• Lac! 



5 10 
Time (min) 



15 



Figure 5 Lad redistribution as a result of nuceloid condensation. (A) The steady-state distribution of DNA and Lacl-Venus in cells with lacl-venus integrated near the 
origin (atpl locus) and grown in M63 + 0.5% glycerol, cell length of 4.2-4.5 |im, are shown at f = 0. After this sample was taken, 200 |ig/ml chloramphenicol was added 
directly to the culture and samples were withdrawn and fixed at the indicated times. Note that the color scale has been changed somewhat from Figure 4 to accommodate 
the condensed DNA. (B) Cross-sections along the longitudinal axis of the DNA (blue) and Lacl-Venus (red) distributions at time t=0 (top) and f = 10 min (bottom). Lines 
are a moving average of six adjacent points. (C) The rate of mass displacement. Displacement is quantified as the total difference between each distribution and the initial 
distribution at time t= 0. The black line corresponds to a simultaneous fit of both data sets to the equation D{t) = l-e~^^. The equivalent result for Lacl42-Venus is 
shown in Supplementary Figure 6. Source data is available for this figure in the Supplementary Information. 



parsimonious explanation for these data is that the redistribu- 
tion of Lacl-Venus protein is driven by modifications of the 
DNA condensation state (see also Figures 3 and 4 and 
Supplementary Figure 6 for the equivalent result for LacI42- 
Venus and Supplementary Figure 7B for similar results 
showing a tendency for nucleoid exclusion in response to 
serine hydroxamate) . 

Nucleoid exclusion has previously been observed for large 
aggregates of proteins (Ebersbach et aU 2008; Winkler et aU 
2010), and it is thought that exclusion may be driven by 
aggregation combined with entropic mechanisms (Saberi and 
Emberly, 2010). However, the distributions reported here are 
not due to Lacl-Venus aggregation: expression levels from 
chromosomal sources are low (^1-10 molecules per cell; 
Gilbert and Muller-Hill, 1966), and bright spots in the images 
correspond to individual Lacl-Venus dimers as judged by 
bleaching and blinking kinetics of fluorescent spots in 
individual fixed and living cells (Supplementary Figure 8). 
Furthermore, when a specific Lad binding target is present in 
the genome, Lacl-Venus binds at the predicted location 
(Supplementary Figure 5D}, the expected behavior for fully 
soluble and functional Lad dimers. The specific binding 
regions used in Supplementary Figure 5D are variants of the 
Puac-oi promoter (Lutz and Bujard, 1997; Kuhlman and Cox, 
2010). Since this promoter binds at most two dimers and 
is easily distinguished using our method, this is also a 
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strong indication that we are imaging individual functional 
molecules. 

The patterns of localization described here do not appear to 
be artifacts of fixation. We show in Supplementary Figure 7A 
images of live stationary phase cells expressing LacI42-Venus 
from both plasmid and chromosomal sources, showing 
the same pattern of localization toward the ends of the 
cells reported in Figure 3, and in Figure 5 in response to 
chloramphenicol. We also show in Supplementary Movie 1 a 
movie of those cells shown in Supplementary Figure 7A with a 
chromosomal LacI42-Venus source that shows that the motion 
of LacI42-Venus molecules is severely slowed and spatially 
localized toward the ends of the cell. The spots shown in this 
movie are single LacI42-Venus molecules as determined by 
blinking and bleaching kinetics (Supplementary Figure 8B} . 



TF distributions affect gene regulation 

Because the strength with which bacterial promoters are 
regulated is a function of the concentration of the relevant 
transcription factors (Buchler etaU 2003; Bintu etaU 2005a, b), 
we next attempt to detect the local enhancement of Lad 
concentration near the lad gene through the repression 
strength of a Lad-regulated target. We constructed three 
famihes of 14 strains where we inserted into previously 
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Figure 6 Repression strength as a function of intergenic distance. Repression strength as a function of genomic distance from the lacl-venus gene, where lacl-venus 
is near (A) origin (atpl locus), (B) mid-replichore (ybbD locus), or (C) terminus (nth locus). Red: slow growth (x = 11 0 ± 1 2 min); yellow: medium growth (x = 68 ± 5 min); 
green: fast growth (x = 22 ± 2 min); black: M63 + 0.5% glycerol + 2 mM IPTG control; circles are the mean of four measurements, error bars are the s.d. Dashed lines 
indicate best fits assuming spatially homogeneous distribution of repressor. Solid lines indicate best fits assuming exponential Lacl-Venus inhomogeneity. Source data is 
available for this figure in the Supplementary Information. 



constructed lacl-venus strains a cassette bearing a Lacl- 
regulated lacZ target at regular intervals around the genome 
(Supplementary Figure 1). In Figure 6A-C, the strength of 
repression is shown as a function of intergenic distance for 
each family of strains. Colors indicate the growth rate of the 
culture: slow (red), medium (yellow); fast (green); slow 
+ 2mM IPTG control (black). In each plot, the x axis is 
adjusted so that the location of the repressor gene Ues at the 
center of the x axis (black vertical dashed lines). In all cases, 
the strength of repression increases as the target is moved 
toward the terminus and decreases as the target is moved 
toward the origin. In each case there is a small but reproducible 
1.2-2 X spike in repression when the target is located 
immediately proximal (~ 200 bp downstream) to the repressor 
gene. We encounter little interference from position-specific 
effects, with a few exceptions: in all cases, the target inserted 
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near the terminus at the essQ locus (see Supplementary 
Figure 1) remained unregulated. Additionally, when lacl-venus 
is inserted near the origin (Figure 6A) the repression strengths 
at essQ and nth near the terminus are significantly weaker than 
might otherwise be expected. This may be due to their location 
within the less densely packed chromosomal crossing region 
(Wiggins et aU 2010) . These data were excluded from the best 
fit lines in Figure 6A-C. 



Analysis and modeling: repression strength is 
inconsistent with a homogeneous TF distribution 

To understand these trends, it is necessary to model the 
response of the reporting promoter to its regulating TF. In 
steady state, the binding and unbinding of an operator by a TF 
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is described by the simple equilibrium equation 

[TF] + [O] ^ [TF - O] 

k 



K-- 



[TF][0] 
[TF - O] ' 



(2) 



Consequently, at equilibrium any effects of TF-target 
intergenic spacing can be exerted only through the TF 
concentration, [TF] , the concentration of the target operator, 
[0], and/or the equilibrium dissociation constant K. 

In Equation (2) the operator concentration, [0], changes due 
to the absolute location of genes relative to the rephcation 
origin. Because rephcation initiation and elongation occur 
from a single origin in E. colU the period-averaged average gene 
copy number varies as a function of position according to 



(3) 



where C is the time to complete rephcation (40-70 min), D is 
the time lag between completion of rephcation and ceh 
division (22-30 min), x is the doubling time of the culture, 
and X is the fractional location of the gene between the 
replication origin and terminus (origin at x = 0 and terminus at 
X = 1} (Cooper and Helmstetter, 1968; Helmstetter and Cooper, 
1968; Bremer and Churchward, 1977; Bremer and Dennis, 
1996). Because at the fastest growth rates reinitiation of 
rephcation occurs before the first termination, on average 
there can be as many as eight copies of origin-proximal genes 
for every one terminus-proximal gene. Increased intergenic 
spacing therefore leads to an imbalance of relative TF-target 
gene copy numbers if TF and target genes are located at 
disparate locations on the chromosome. 

To describe the repression data, we combine Equations (2) 
and (3) with a statistical mechanical model of transcriptional 
regulation of the promoter expressing lacZ (Buchler et aU 
2003; Bintu et aU 2005a, b). For the promoter used in these 
experiments, we find that the repression strength, p, can be 
predicted as 



[TF] 
K 



(4) 



(see Supplementary information — Promoter Modeling and 
Supplementary Figure 9} . This model is used to fit the data in 
Figure 6. 

We first fit the data assuming a spatially homogeneous 
distribution of repressor (dashed colored lines in Figure 5 A-C) . 
The fits recapitulate the general trends: as the target is moved 
toward the terminus the ratio of average TF to target gene 
number increases, leading to increased repression strength. 
Conversely, when the target is moved toward the origin, the 
ratio of average TF to target gene number decreases, resulting 
in decreased repression strength. 

While these fits display the correct general trends, in ah 
cases the strength of repression near the TF gene is ~1.5 x 
stronger than predicted by the nuh model. We note the 
similarity of this enhancement to the observed enhancement 
of Lacl-Venus concentration summarized in Figure 2. Accord- 
ing to Equation (4), p increases linearly with [TF] for this 
promoter; therefore, increases in TF concentration should 
result in corresponding enhancements in repression strength. 



Analysis and modeling: repression strength is 
consistent with local inhomogeneity 

Keeping the microscopy results in mind, we next incorporate a 
local TF concentration gradient into our model. Since we 
assume that the gradient is generated by diffusion, and a 
steady-state diffusion gradient with constant linear degrada- 
tion is exponential regardless of the details of the diffusion 
process (dimensionality, diffusion versus anomalous diffu- 
sion, etc.) (Hornung et aU 2005), we postulate a local 
exponential gradient that decays to a non-zero background 
concentration of repressor: 



[TF(x)] = [TF][l+yexp(- |x-Xo |A)] 



(5) 



where Xq is the location of the TF gene and x is the location in 
space where the concentration is being measured. The first 
term describes a constant background distribution of TF, while 
the second term describes a local exponential concentration 
gradient characterized by two additional parameters: a length 
scale X and a local enhancement factor y (we consider the 
origins of such a function in Discussion). 

Combining Equation (5) into our model and fitting the 
repression strength data yields the sohd lines in Figure 6 and 
the fit parameters in Supplementary Table 2. In contrast to 
the fits assuming homogeneous repressor concentration, this 
model recapitulates the trends in the data as weh as the local 
enhancement of repression near the TF gene. Given that 
Koid -0.1 nM (Oehler et al 1994, 2006), the extracted TF 
concentrations are in the range of 0.1-14 nM (Gilbert and 
Muher-Hih, 1966); the smahest values can possibly be 
explained as a consequence of increasing K due to growth 
effects (see Supplementary information — Dilution and Growth 
Effects and Supplementary Figure 10) . 

Discussion 

The origins of TF inhomogeneity 

That steady-state TF concentration inhomogeneity can be 
observed at ah is surprising given the smah size of the ceh and 
the time scales involved. In E. colU TFs are produced from a 
localized source due to the persistent localization of mRNAs 
observed previously (Montero Llopis et aZ, 2010) and in this 
report and the spatial coupling of transcription and translation. 
Naively, as repressor diffuses away from this source, the shape 
of the resulting gradient should be determined by the ratio of 
the diffusion constant to the degradation rate of the TF (Ibanes 
and Izpisua Belmonte, 2008). The in vivo diffusion constant of 
Lad- Venus has been measured by live tracking of individual 
molecules to be ^0.4 |im^/s (Elf et aU 2007), and the average 
time required for Lacl-Venus to diffuse from one end of the ceh 
to the other is therefore on the order of 1-10 s. The proteolytic 
degradation rate of Lad, on the other hand, is thought to be 
sufficiently small that dilution due to growth is generally 
assumed to be the primary mechanism for depletion (e.g., 
Narang and Pilyugin, 2008). That the dilution time scale is a 
minimum of 100-1000 x slower than the diffusion time scale 
might lead one to expect that significant steady-state intracel- 
lular TF concentration inhomogeneity is impossible and that 
the intracellular TF concentration wih rapidly homogenize. 
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From a physical point of view, the observed inhomogeneity 
of TFs can be explained only by a limited variety of 
mechanisms, and all of these mechanisms may be at play to 
some degree. One possibility is that the system could be 
heterogeneous as a transient state. This explanation appears to 
be ruled out as such transient states should be short lived 
based upon the known rates of diffusion observed by live 
tracking of individual molecules (Elowitz et al, 1999; Elf et aU 
2007) . However, since live tracking requires the maturation of 
fluorophores before they can be imaged (~ 10 min for Venus; 
Nagai et aU 2002; Elf et aU 2007), it is possible that there exist 
intranucleoid states immediately after TP production which 
diffuse anomalously or more slowly than heretofore reported. 

Another possibility is spatial heterogeneity of diffusion 
coefficients. However, this mechanism can only maintain 
inhomogeneity if there is localized production of TFs 
combined with degradation (Ibanes and Izpisua Belmonte, 
2008; Warthck et al 2009). TFs are produced locally as a 
consequence of the persistent localization of mRNAs at the site 
of transcription (Montero Llopis et aU 2010} and the spatial 
coupling of transcription and translation. Under these condi- 
tions, the steady state resulting from localized production 
balanced with degradation, dilution, or nucleoid exclusion can 
lead to TF inhomogeneity within the nucleoid. 

Finally, spatial heterogeneity can be driven by spatial 
variation in energetic potentials for DNA, mRNA, and TFs, 
resulting in heterogeneity even at thermodynamic equili- 
brium. These spatial variations are almost certainly important 
here due to, e.g., the energetic favorability of TF-DNA binding 
and the observed exclusion of proteins from highly compacted 
DNA, perhaps driven by entropic mechanisms (Saberi and 
Emberly, 2010). 

We propose that the observed inhomogeneity is the result of 
competing forces of binding to DNA on one hand and 
exclusion from DNA on the other hand, and that the relative 
magnitude of these forces is dependent upon the local DNA 
compaction density. That the magnitude of the exclusion force 
is an increasing function of DNA density can be observed 
straightforwardly by examining the degree of exclusion of 
LacI42-Venus as a function of growth rate. With fast growth 
and low DNA density (last column in Figure 4) , mutant LacI42- 
Venus is not obviously excluded from the DNA volume. 
However, as growth rate decreases and compaction increases, 
exclusion increases until the mutant LacI42-Venus is more 
highly localized toward the ends of the cells in stationary 
phase (second column in Figure 3). 

These competing forces will influence the diffusive kinetics 
of TFs within the nucleoid. In steady state, we propose to 
describe intranucleoid diffusion by a mean-field model in 
which TFs can be separated into two sub-populations: those 
TFs confined within the nucleoid, [TFJ, and those TFs 
undergoing 3D diffusion in the cytoplasm, [TF3]: 

6[TF^(x,0] +DiV2[TFi] - {k, + P)[TFi] + ^[TFs] 

^'^^^f '^' -oc(l-f)5(f-fo) +D3V^[TF3] 
-(fc2 + P)[TF3] + fci[TFi] 

(6) 
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where a is the rate of TF production, p is the rate constant for 
degradation or dilution, Di and D3 are the intranucleoid and 
extranucleoid diffusion coefficients, respectively, ki and /C2 are 
nucleoid exclusion and capture rate constants, respectively, 
and f is the fraction of TFs initially confined to the nucleoid 
after translation. We postulate that hops through 3D space of 
short duration within the nucleoid can be coarse grained along 
with true ID shding along DNA into effective ID diffusion 
along the nucleoid. Only when TFs escape into the cytoplasm 
for an extended 3D excursion they are considered to be 
undergoing true 3D diffusion, and this escape is assumed to 
occur with linear rate constant ki as a result of nucleoid 
exclusion forces. In this view, the escape rate constant ki 
serves as the degradation rate constant of the nucleoid- 
confined sub-population. After escape, TFs diffusing in 3D 
rapidly form a homogeneous background pool and can be 
recaptured by the nucleoid with linear rate constant /C2. 
A cartoon illustrating this argument is shown in Figure 7. The 
steady-state solution of a simphfied ID version of this model 
yields an enhancement of TF concentration around the 
encoding gene located at a position Xq (Kuhlman and Cox, in 
preparation) : 

[TF(x)] = [TF] [1 + y exp( - I X - Xq I A)] (7) 

where x is the longitudinal distance from the source gene and 
we define the constants 

[TF] = -[TF3], y = ^^^y^^^^y 

X = VDi/(/ci + P) 

The solution (7) is precisely the functional form postulated 
in Equation (5), and has two terms: a constant background 
concentration [TF], and a local enhancement of concentration 
around the encoding gene whose magnitude relative to the 
background is characterized by the parameter y. This 
parameter is directly proportional to the fraction of TFs 
immediately captured by the nucleoid after translation, f. 
Thus, when the source of production is located within 
the nucleoid, f>0, yielding non-zero y and localized 
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Figure 7 A model of intranucleoid diffusion. TFs are produced at the site of the 
source gene within the nucleoid. TFs are initially confined to the nucleoid and 
diffuse via 1D sliding along the chromosome combined with short 3D hops 
between domains, which we coarse grain together as effective 1 D diffusion along 
the entire nucleoid. TFs are excluded from the nucleoid with rate constant k^, 
where they quickly homogenize via rapid 3D diffusion to form a background pool 
of repressor. TFs can be recaptured into the nucleoid rate constant /C2, and are 
diluted with rate constant p. 
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inhomogeneity. When TFs are produced from an extrachro- 
mosomal source such as a plasmid directly into the cytoplasm, 
f= 0, and this results in homogeneous distribution throughout 
the DNA, as observed experimentally (Figure 2} . 

After production, it takes ~ 10 min for the Venus fluorophore 
to mature (Nagai et al 2002; Elf et al 2007), a process that 
occurs even after fixation by paraformaldehyde (Little et aU 
2011} . Thus, we beheve that our results remain consistent with 
previous live-tracking measurements of rapid intracellular TP 
diffusion (Elf et aU 2007) because live tracking requires the 
fluorophore to mature before it can be visualized. The 
nucleoid bound state leading to inhomogeneity, [TFi], may 
decay before maturation, and we argue that the cytoplasmic 
fraction [TF3], interacting with the nucleoid, may primarily be 
what was imaged in live-tracking experiments. 

The observations reported in this manuscript are in accord 
with previous reports regarding d5-acting proteins, such as the 
}^-phage Q protein (Echols et aU 1976), the bacteriophage 
9X174 A protein (Francke and Ray, 1972), and the ISIO 
transposase of the TnlO system (Morisato et aU 1983; Jain and 
Kleckner, 1993). These proteins have been shown to prefer- 
entially act upon closely located targets (Morisato et aU 1983; 
Jain and Kleckner, 1993), or to act only while synthesis is in 
progress, but cease to function later, even though intact and 
functional protein is still present (Echols et aU 1976). They are 
also generally DNA binding proteins, and their initial 
mechanism of action is similar to that of TFs such as Lad: 
the recognition of, and binding to, some cognate DNA 
sequence. This binding and unbinding will be governed by 
an equilibrium analogous to Equation (2), and we therefore 
suggest that the observed preferential d5-action may also be a 
consequence of protein inhomogeneity and concentration near 
the source gene. Furthermore, the activity of the ISIO 
transposase on transposons has been assayed as a function 
of the transposase/transposon intergenic distance (Morisato 
et al, 1983; Jain and Kleckner, 1993), and from an analysis of 
this data we find that this activity decays exponentially with 
distance, similar to that reported here with Lac repressor and 
modeled in Equations (5) and (7) (see Supplementary 
Figure 11). 

We also expect that the same rules of intranuceloid versus 
cytoplasmic production should influence the diffusive 
kinetics and spatial distribution of RNAs, and the results 
presented here may reconcile the apparent discrepancy 
between the observation of the rapid diffusion rates of RNAs 
(Golding and Cox, 2004; Golding et aU 2005) contrasted with 
the apparent colocalization of RNAs with chromosomally 
located source genes reported here and by Montero Llopis et al 
(2010) . The rapid diffusion rates for RNAs observed by Golding 
and Cox resulted from quantitative measurement of RNAs 
produced from polar-localized bacterial artificial chromo- 
somes; i.e., the RNAs were transcribed directly into the 
cytoplasm (f=0. Equation (6)). This is similar to the 
extrachromosomal plasmid reported here where it appears 
that the RNAs it produces have an enhanced tendency to 
redistribute compared with chromosomally produced RNAs 
(Figure 2; Supplementary Figure 5B). We might also expect 
that the diffusive kinetics of RNAs transcribed within the 
nucleoid are significantly different from those transcribed in 
the cytoplasm, and that the more crowded intranuceloid 
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environment will lead to slower rates of diffusion and a higher 
degree of RNA/gene colocalization. 

Dynamic nucleoid organization as a regulatory 
strategy 

The observation that E. coli can dynamically alter the 
distribution of TFs and other intracellular proteins by changing 
the compaction state of its DNA may provide a mechanism by 
which E. coli can shift between global gene expression 
patterns. It has been previously demonstrated that E. coli 
generally reduces the expression of genes near the origin of 
rephcation and increases the expression of genes near the 
terminus upon DNA condensation (Berger et aU 2010) . We note 
that since origin proximal genes lie within the body of the 
nucleoid and terminal-proximal genes he within the loosely 
packed chromosomal crossing region which is exposed to the 
cytoplasm (Wiggins et aU 2010), these changes in gene 
expression correlate with the spatial changes in TF and protein 
distribution we observe during chromosomal condensation. 
Furthermore, we also observe reorganization and redistribu- 
tion of DNA and protein upon exposure to serine hydroxamate, 
which induces the stringent response via amino-acid starva- 
tion (Ferullo and Lovett, 2008; Supplementary Figure 7B). We 
therefore suggest that protein redistribution driven by DNA 
condensation may be a general strategy for quickly altering 
global gene expression patterns in response to environmental 
stimuh, and, in particular, for sweeping non-specifically bound 
TFs from the DNA. 



Implications for bacterial genome organization 

There is mounting evidence that the spatial organization of the 
prokaryotic genome is determined by the connectivity and co- 
regulation of transcriptional regulatory networks, rather than 
through selection via horizontal gene transfer (Pal and Hurst, 
2004; Warren and ten Wolde, 2004; Hershberg etaU 2005; Price 
et aU 2005; Kolesov et al 2007; Wunderhch and Mirny, 2008; 
Janga et al, 2009; Yin et al, 2010). This work provides 
experimental evidence for a possible mechanism driving this 
organization. Because of the inhomogeneity of the intracel- 
lular TF concentration, gene targets are regulated more 
strongly and more quickly when located closer to their 
regulatory TF genes (Kolesov et al, 2007; Wunderhch and 
Mirny, 2008; Mirny et al, 2009). Because spatial distance is 
directly proportional to genomic distance (Wiggins et al, 2010) , 
these regulatory benefits may provide selective pressure 
driving the close genomic organization of co-regulated genes 
on the bacterial chromosome. 



Implications for eukaryotes 

Because eukaryotic TF proteins are not produced locally, 
transcriptional regulation of eukaryotic genes by TFs is not 
subject to the same effects of TF gene localization in bacteria 
that we focus upon here. However, the sites of nuclear re- 
importation serve as localized sources and may result in 
intranuclear TF inhomogeneity. This naive preconception may 
anticipate recent studies that reveal that some regulated genes 

© 2012 EMBO and Macmillan Publishers Limited 



Transcription factor distributions in £. coli 
TE Kuhlman and EC Cox 



in yeast actively reorganize to the nuclear periphery and 
associate with nuclear pores upon exposure to inducing 
conditions (Ahmed and Brickner, 2007; Brickner et aZ, 2007; 
Ahmed et al 2010). 

Materials and methods 
Strains and plasmids 

All strains used in this study are derivatives of E. coli K-12 MG1655 
Alac (Kuhlman and Cox, 2010), in which the entire lac operon has been 
deleted from the N-terminus of lad to the C terminus of lacA using 
the method of Datsenko and Wanner (2000). Strains with 
ViacrlacI-venus::Tl (Nagai et al, 2002; Elf et al, 2007; Kuhlman and 
Cox, 2010) integrated at specific locations in the chromosome were 
constructed using Landing Pad technology as described previously 
(Kuhlman and Cox, 2010) and in Supplementary information. Required 
permissions for use of venns were obtained from Dr Atsushi Miyawaki 
at the Riken Brain Science Institute. 



Media and growth conditions 

To obtain various doubling times, cells were grown at 37°C in a shaking 
water bath with M63 medium (100 mM KH2PO4, 15 mM (NH4)2S04, 
1 .7 |iM FeS04, 1 mM MgS04) + 0.5 % glycerol, or Rich Defined Medium 
(Teknova) + 0.5 % glucose without antibiotics. We estimate that at the 
time of harvest cells had been growing in exponential steady state for 
~10 generations. Representative growth curves for each growth 
condition are shown in Supplementary Figure 2. A detailed protocol for 
cell growth can be found in Supplementary Information. For fixation, 
an equal volume of freshly prepared and filtered 5 % paraformalde- 
hyde in phosphate-buffered saline (PBS) was added directly to each 
culture. The resulting solution was allowed to continue shaking at 
37 °C for lOmin and was then placed on ice for 30min. Cells were 
washed three times via centrifugation and resuspension in 1 ml 
filtered, ice-cold PBS. 



Fluorescent reporter-operator system 

Gene locations were determined using the FROS performed as 
described in Joshi et al (2011), the components of which were the 
kind gift of David Bates. Integrations were made at each site consisting 
of an array of 240 operators for tet repressor, TetR, using the Landing 
Pad technology. After growth to steady state as described above, 0.01 % 
L-arabinose was added to each culture 1 h before fixation to induce 
expression of TetR tagged with fluorescent EYFP in trans from the 
plasmid pBH74 (Joshi et al, 2011). Cells were then fixed and processed 
as above. 



Fluorescent in situ hybridization 

mRNA locations were determined by single-molecule FISH, performed 
as described in So et al (2011) using an array of 48 bp Stellaris FISH 
probes covering the coding sequence of lad and labeled with CAL 
Fluor Red 590 purchased from Biosearch Technologies. Control 
experiments using similarly prepared probes for the coding strand 
of DNA yielded no bright spots, demonstrating specific labeling of 
mRNA. 

After fixation, cells in PBS were spread onto poly-L-lysine coated #1 
22 mm x 60 mm coverslips and allowed to adhere for 30min. Cover- 
slips were rinsed with PBS and soaked in 70% ethanol for 1 h to 
permeabilize the cell wall. The slides were then rinsed in wash buffer 
(40% formamide, 2x SSC buffer). In all, 100 |il of hybridization 
solution (40 % formamide, 2 x SSC buffer, 1 mg/ml E. coli tRNA, 
200 |ig/ml BSA, 2 |iM ribonucleoside vanadyl complex, 2 |il of 2.5 |iM 
probe stock) was added to each coverslip, which was then covered 
with two 22 mm x 22 mm HybriSlip hybridization covers (Invitrogen) . 
The coverslips were enclosed in watertight aluminum slide chambers 
and hybridized overnight in a 30°C water bath. The next morning, slips 



were washed three times for 30min each in wash buffer warmed to 
30°C. They were then rinsed once in 2 x SSC and once in PBS, and 
finally mounted on glass slides with 40% glycerol. 



Microscopy 

After the appropriate preparation for the sample, the prepared 
coverslips were mounted on glass slides using 40% glycerol and, if 
appropriate, 5 |ig/ml DAPI as a DNA stain. Nucleoid, FROS, and mRNA 
imaging were performed using a Nikon Eclipse TE2000U microscope 
with an Applied Scientific Instruments PZM-2000 automated stage 
utilizing Metamorph automation software. Images were gathered 
using epifluorescent illumination of the appropriate wavelength with a 
X 100 phase contrast objective combined with a x 4 telescope 
attachment using a Roper Scientific Cascade: 51 2 camera. 

Single-molecule Lacl-Venus microscopy was performed using a 
Nikon Eclipse TE2000U microscope with either a 488-nm laser 
(10 mW) or a 515-nm laser to improve the signal-to-noise ratio for 
Lacl-Venus when needed, a TIRF microscopy attachment, and a Prior 
ProScan II automated stage using Andor iXon Basic software at x 133 
(for visual clarity in representative images) or x 400 magnification 
(for image averaging). Emitted light was collected through the 
appropriate filter (ET 535/30 M) and the image captured by an Andor 
DV877-DCS-BV electron multiplying charge coupled device (EMCCD) 
camera. Imaging of the nucleoid with the same TIRF microscope after 
staining with SYBR green I (Invitrogen) yields the same average DNA 
distribution as that obtained with epifluorescent illumination of DAPI- 
stained cells at all growth rates, indicating that the penetration depth of 
the evanescent TIRF wave is sufficiently deep to observe a volume 
representative of the entire nucleoid. 



Data and image analysis 

All data and images were analyzed using custom software written in 
MATLAB 2010a (Math Works). The distribution of Lacl-Venus in each 
figure is determined as the difference in average spatial distribution 
of fluorescence in a Lacl-Venus strain minus the average spatial 
distribution of autofluorescence of the same strain without the 
lacl-venus integration, grown under identical conditions. In particular, 
distributions obtained after the addition of chloramphenicol in 
Figure 5 were compared with samples of the negative control treated 
identically with chloramphenicol at each time point. Image analysis is 
discussed thoroughly in Supplementary Information — Image Analysis. 



LacZ measurements 

For measurements of repression strength, the cells were grown as 
described above, and LacZ activity measured using traditional 
methods (Miller, 1972). When the density of the culture reached 
00600 = 0-2-0.4, a 0.5-ml sample of culture was added directly to 
0.5 ml Z-buffer (Miller, 1972) and 40 [i\ chloroform (Fisher Scientific). 
Each sample was thoroughly mixed via repeated pipetting. In all, 
200 |il of 4 mg/ml ortho-nitrophenyl-p-galactoside (ONPG; Sigma- 
Aldrich) in Z-buffer was added. When sufficient yellow color had 
developed to be visible by eye, 500 |il of 1 M Na2C03 was added and the 
elapsed time recorded. OD420 and OD550 of each sample were 
measured in a spectrophotometer (Bio-Rad SmartSpec 3000) and the 
LacZ activity, a, was calculated as 



a=1000x- 



OD420- 1.75xOD550 



2xAfxOD600 
Repression strength, p, defined as 

p = a_/a + 



(9) 



(10) 

and shown in Figure 6 is the ratio of the average of four measurements 
from independent cultures of each strain with (a + ) or without (a _ ) lac 
repressor. Error bars are calculated as 



=(p)> 



(11) 
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where are the standard deviations of the four measurements of 
a + and a _ . 



Supplementary information 

Supplementary information is available at the Molecular Systems 
Biology website (www.nature.com/msb). 
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